Design Issues for Second Generation Stream Processing Engines
نویسندگان
چکیده
Borealis is a second-generation distributed stream processing engine that is being developed at Brandeis University, Brown University, and MIT. Borealis inherits core stream processing functionality from Aurora [13] and distribution functionality from Medusa [49]. Borealis modifies and extends both systems in non-trivial and critical ways to provide advanced capabilities that are commonly required by newly-emerging stream processing applications. In this paper, we outline the basic design and functionality of Borealis. Through sample real-world applications, we motivate the need for dynamically revising query results and modifying query specifications. We then describe how Borealis addresses these challenges through an innovative set of features, including revision records, time travel, and control lines. Finally, we present a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.
منابع مشابه
Linked Stream Data Processing Engines: Facts and Figures
Linked Stream Data, i.e., the RDF data model extended for representing stream data generated from sensors social network applications, is gaining popularity. This has motivated considerable work on developing corresponding data models associated with processing engines. However, current implemented engines have not been thoroughly evaluated to assess their capabilities. For reasonable systemati...
متن کاملOptimization of Continuous Queries in Federated Database and Stream Processing Systems
The constantly increasing number of connected devices and sensors results in increasing volume and velocity of sensor-based streaming data. Traditional approaches for processing high velocity sensor data rely on stream processing engines. However, the increasing complexity of continuous queries executed on top of high velocity data has resulted in growing demand for federated systems composed o...
متن کاملRe-design of Downstream Processing Techniques for Nanoparticulate Bioproducts
There has been much interest generated in the recovery of nanoparticulate (nanoparticle) bioproducts(Second generation of biotechnological products) such as plasmid DNA and viruses as putative gene therapyvectors, macromolecular assemblies as drug delivery vehicles and virus-like particles as vaccine components.Such product must be manufactured in advanced stages of purity, ma...
متن کاملTutorial: Cloud-based Data Stream Processing
In this tutorial we present the results of recent research about the cloud enablement of data streaming systems. We illustrate, based on both industrial as well as academic prototypes, new emerging uses cases and research trends. Specifically, we focus on novel approaches for (1) scalability and (2) fault tolerance in large scale distributed streaming systems. In general, new fault tolerance me...
متن کاملPRSP: A Plugin-based Framework for RDF Stream Processing
In this paper, we propose a plugin-based framework for RDF stream processing (PRSP). With this framework, we can apply SPARQL engines to process C-SPARQL queries with maintaining the high performance of those engines in a simple way. Taking advantage of PRSP, we can process large RDF streams in a distributed context via distributed SPARQL engines. Moreover, we can evaluate the performance and c...
متن کامل